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The present invention relates to DNA sequences encoding novel 
growth/differentiation factors of the TGF-S family. In 
particular, it relates to novel DNA sequences encoding TGF- £- 
like proteins, to the isolation of said DNA sequences, to 
expression plasmids containing said DNA, to microorganisms 
transformed by said expression plasmid, to the production of 
said protein by culturing said transf ormant , and to pharma- 
ceutical compositions containing said protein. The TGF-& 
family of growth factors comprising BMP, TGF, and Inhibin 
related proteins (Roberts and Sporn, Handbook of Experimental 
Pharmacology 95 (1990), 419-472) is of particular relevance 
in a wide range of medical treatments and applications. These 
factors are useful in processes relating to wound healing and 
tissue repair. Furthermore, several members of the TGF-E 
family are tissue inductive, especially osteo-inductive, and 
consequently play a crucial role in inducing cartilage and 
bone development. 

Wozney, Progress in Growth Factor Research 1 (1989) , 267-280 
and Vale et.al., Handbook of Experimental Pharmacology 95 
(1990) , 211-248 describe different growth factors such as 
those relating to the BMP (bone morphogenetic proteins) and 
the Inhibin group. The members of these groups share 
significant structural similarity. The precursor of the 
protein is composed of an aminoterminal signal sequence, a 
propeptide and a carboxyterminal sequence of about 110 
amino acids, which is subsequently cleaved from the precursor 
and represents the mature protein. Furthermore, their members 
are defined by virtue of amino acid sequence homology. The 
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mature protein contains the most conserved sequences 
especially seven cysteine residues which are conserved among 
the family members. The TGF-E-like proteins are 
Afunctional, hormonally active growth factors. T ey also 
share related biological activities such as chemotactio 

„=n differentiation and their 
attraction of cells, promoting cell differentia 

tissue-inducing capacity, such as cartilage- and bone- 
inducing capacity. U.S. Patent No. 5.013.649 discloses DNA 
sequences encoding osteo- inductive proteins termed BMP- 2 
proteins (bone morphogenetic protein), and U.S patent 
applications serial nos. 179 101 and 179 197 disclose the BMP 
proteins BMP-1 and BMP-3. Furthermore, many cell types are 
able to synthesize TGF-S-like proteins and virtually all 
cells possess TGF-6 receptors. 

Taken together, these proteins show differences in their 
structure, leading to considerable variation in their 

ailed biological function, rurthermore they are oun i 
a wide variety of different tissues and developmental stages^ 
Consequently, they might possess differences concen ng their 
function in detail, for instance the required cellular 
physiological environment, their lifespan, their targets 
the element for accessory factors, and their resistance 

o degradation. Thus, although numerous P"»™^^ 
tissue-inductive, especially osteo- inductive potential 
described their natural role in the organism and, more 

im ortanUy, their medical relevance must still be elucr ated 

oi- -i 11 -unknown members or tne 

in detail. The occurrence of still unicno 

TGF-6 family relevant for osteogenesis or 
differentiation/induction of other tissues is strongly 
suspected. However, a major problem in the isolatior of these 

^ c t-hat- f heir functions cannot yet oe 
new TGF-6-like proteins is that their iui 

described precisely enough for the design of a discriminative 
oioas a^ On the other hand, the expected nucleotide sequence 
» known members of the family would be too low to 



allow for screening by classical nucleic acid hybridization 
techniques. Nevertheless, the further isolation and 
characterization of new TGF-fi-like proteins is urgently 
needed in order to get hold of the whole set of induction and 
differentiation proteins meeting all desired medical 
requirements. These factors might find useful medical 
applications in defect healing and treatments of degenerative 
disorders of bone and/or other tissues like, for example, 
kidney and liver. 

Thus the technical problem underlying the present invention 
essentially is to provide DNA sequences coding for new 
members of the TGF-E protein family having mitogenic and/or 
differentiation-inductive, e.g. osteoinductive potential. 

The solution to the above technical problem is achieved by 
providing the embodiments characterized in claims 1 to 17 . 
Other features and advantages of the invention will be 
apparent from the description of the preferred embodiments 
and the drawings. The sequence listings and drawings will now 
briefly be described. 

fi p,r> TP NO. 1 shows the nucleotide sequence of MP-52, i.e. the 
embryo derived sequence corresponding to the mature peptide 
and most of the sequence coding for the propeptide of MP-52. 

Some of the propeptide sequence at the 5' -end of MP-52 has 
not been characterized so far. 

gp.n TP NO. 2 shows the nucleotide sequence of MP-121, i.e. 
the liver derived sequence corresponding to the mature 
peptide, the sequence coding for the propeptide of MP-121, 
and sequences 5' and 3' to the coding region. 



The start codon begins with nucleotide 128 of SEQ ID NO. 2. 
The sequence coding for the mature MP121 polypeptide begins 
with nucleotide 836 of SEQ ID NO. 2. The stop codon begins 
with nucleotide 1184 of SEQ ID NO. 2. The sequence coding for 
the precursor protein has a length of 1056 bp. The sequence 
coding for the propeptide has a length of 708 bp and the 
sequence coding for the mature peptide has a length of 34 8 
bp . 

SEP ID NO. 3 shows the amino acid sequence of MP- 52 as 
deduced from SEQ ID NO. 1. 

SEP ID NO. 4 shows the amino acid sequence of MP- 121 as 
deduced from sequence SEQ ID NO. 2. The sequence of the mature 
polypeptide begins with amino acid 237 of SEQ ID NO. 4. The 
precursor protein has a length of 352 amino acids. The 
propeptide and the mature peptide have a length of 23 6 and 
116 amino- acids, respectively. 

SEP ID NO. 5 shows a part of the nucleotide sequence of the 
liver derived sequence of MP-121. 

SEP ID NO, 6 shows a part of the nucleotide sequence of the 
embryo derived sequence of MP-52. 

The shorter DNA- sequences SEQ ID NO. 5 and 6 can be useful 
for example for isolation of further members of the TGF-S- 
protein family. 

- ^isaurejl shows an alignment of the amino acid sequences of 
MP-52 andMP^24starting from the first of the seven 
conserved cysteines^itji some related proteins. la shows the 
alignment of MP-52 with som^^mbers of the BMP protein 
family; lb shows the alignment of MP^LH with some members of 
the Inhibin protein family. * indicates thatthe amino acid 
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\7is the same in All proteins compared; + indicates that the 
( amino acid is th\ same in at least one of the proteins 
compared with MPA52 (Fig. la) or MP-121 (Fig. lb). 

.iaure 2 shows th\ nucleotide sequences of the oligo- 
^Wde primer as used in the present invention and an 
alignment of these sequences with known members of the TGF-E 
famil M means A or C; S means C orV- R -ans A or G; and K 
means\or T. 2a depicts the sequence of the pr.mer OD; 2b 
shows the\ sequence of the primer OID. 

The presen\ invention relates to novel TGF-S-like proteins 
and provideW sequences contained in the correspond^ 
genes. Such Sequences include nucleotide sequences 

comprising theXsequence 

ATGAAO^CATCGACCCCGAGTCCACA and 
CTTCTCAAGGCCAACACAGCTGCAGGCACC 
and in particular Wnces as illustrated in SEQ ID Nos . 1 
and 2, allelic derivatives of said sequences and DNA 
sequences degenerated^ a result of the genetic code for 
said sequences. They ako include DNA sequences hybr.dxz.ng 
under stringent conditions with the DNA sequences mentioned 
above and containing the\ollowing amino acid sequences: 
Met-Asn-Ser-Met-Asp-p\o-Glu-Ser-Thr or 
Leu-Leu-Lys-Ala-Asn-ThWa-Ala-Gly-Thr. 

Although said allelic, degenerate and hybridizing sequences 
m ay have structural divergencies due to naturally occurring 
mutations, such as small deletions or substitutions they 
*will usually still exhibit essentially the same useful 

properties, allowing their use in basically the same med.cal 
applications. 

According to the present invention, the term "hybridization- 
conventional hybridization conditions, preferably 
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conditions with a salt concentration of 6 x SSC at 62° to 
66°C followed by a one-hour wash with 0.6 x SSC, 0.1% SDS at 
62° to 66°C. The term "hybridization" preferably refers to 
stringent hybridization conditions with a salt concentration 
of 4 x SSC at 62°-66°C followed by a one-hour wash with 0.1 x 
SSC, 0.1% SDS at 62°-66°C. 

Important biological activities of the encoded proteins, 
preferably MP- 52, comprise a mitogenic and osteo- inductive 
potential and can be determined in assays according to 
Seyedin et al . , PNAS 82 (1985), 2267-2271 or Sampath and 
Reddi, PNAS 78 (1981), 7599-7603. 

The biological properties of the proteins according to the 
invention, preferably MP-121, may be determined, e.g., by 
means of the assays according to Wrana et al. (Cell 71, 1003- 
1014 (1992)), Ling et al. (Proc. Natl. Acad, of Science, 82, 
7217-7221 (1985)), Takuwa et al . (Am. J. Physiol., 257, E797- 
E803 (1989)), Fann and Patterson (Proc. Natl. Acad, of 
Science, 91, 43-47 (1994)), Broxmeyer et al . (Proc. Natl. 
Acad, of Science, 85, 9052-9056 (1988)), Green et al . (Cell, 
71, 731-739 (1992)), Partridge et al . (Endocrinology, 108, 
213-219 (1981)) or Roberts et al . (PNAS 78, 5339-5343 
(1981) ) . 

Preferred embodiments of the present invention are DNA 
sequences as defined above and obtainable from vertebrates, 
preferably mammals such as pig or cow and from rodents such 
as rat or mouse, and in particular from primates such as 
humans . 

Particularly preferred embodiments of the present invention 
are the DNA sequences termed MP-52 and MP-121 which are shown 
in SEQ ID Nos . 1 and 2. The corresponding transcripts of MP- 
52 were obtained from embryogenic tissue and code for a 



protein showing considerable amino acid homology to the 
mature part of the BMP- like proteins (see Fig. la) . The 
protein sequences of BMP2 (=BMP2A) and BMP4 (=BMP2B) are 
described in Wozney et al., Science Vol 242, 1528-1534 
(1988) . The respective sequences of BMP 5 , BMP6 and BMP7 are 
described in Celeste et al., Proc .Natl .Acad. Sci . USA Vol 87, 
9843-9847 (1990). Some typical sequence homologies, which are 
specific to known BMP-sequences only, were also found in the 
propeptide part of MP- 52, whereas other parts of the 
precursor part of MP- 52 show marked differences to BMP- 
precursors. The mRNA of MP- 121 was detected in liver tissue, 
and its correspondig amino acid sequence shows homology to 
the amino acid sequences of the Inhibin protein chains (see 
Fig. lb) . cDNA sequences encoding TGF-S-like proteins have 
not yet been isolated from liver tissue, probably due to a 
low abundance of TGF-fi specific transcripts in this tissue. 
In embryogenic tissue, however, sequences encoding known TGF- 
S-like proteins can be found in relative abundance. The 
inventors have recently detected the presence of a collection 
of TGF-£-like proteins in liver as well. The high background 
level of clones related to known factors of this group 
presents the main difficulty in establishing novel TGF-S- 
related sequences from these and probably other tissues. In 
the present invention, the cloning was carried out according 
to the method described below. Once the DNA sequence has been 
cloned, the preparation of host cells capable of producing 
the TGF-&-like proteins and the production of said proteins 
can be easily accomplished using known recombinant DNA 
techniques comprising constructing the expression plasmids 
encoding said protein and transforming a host cell with said 
expression plasmid, cultivating the transformant in a 
suitable culture medium, and recovering the product having 
TGF-S-like activity. 
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Thus, the invention also relates to recombinant molecules 
comprising DNA sequences as described above, optionally- 
linked to an expression control sequence. Such vectors may be 
useful in the production of TGF-S-like proteins in stably or 
transiently transformed cells. Several animal, plant, fungal 
and bacterial systems may be employed for the transformation 
and subsequent cultivation process. Preferably, expression 
vectors which can be used in the invention contain sequences 
necessary for the replication in the host cell and are 
autonomously replicable. It is also preferable to use vectors 
containing selectable marker genes which can be easily 
selected for transformed cells. The necessary operation is 
well-known to those skilled in the art. 

It is another object of the invention to provide a host cell 
transformed by an expression plasmid of the invention and 
capable of producing a protein of the TGF-6 family. Examples 
of suitable host cells include various eukaryotic and 
prokaryotic cells, such as E. coli, insect cells, plant 
cells, mammalian cells, and fungi such as yeast. 

Another object of the present invention is to provide a 
protein of the TGF-S family encoded by the DNA sequences 
described above and displaying biological features such as 
tissue-inductive, in particular osteo-inductive and/or 
mitogenic capacities possibly relevant to therapeutical 
treatments. The above-mentioned features of the protein might 
vary depending upon the formation of homodimers or 
heterodimers . Such structures may prove useful in clinical 
applications as well. The amino acid sequence of the 
especially preferred proteins of the TGF-£-family (MP-52 and 
MP-121) are shown in SEQ ID NO. 3 and SEQ ID NO. 4. 

It is a further aspect of the invention to provide a process 
for the production of TGF-£-like proteins. Such a process 
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v^ot- ppII being transformed with a 
comprises cultivating a host cell Dei g culture 
* „f the nresent invention in a suitable culture 
DNA sequence of the present x 

. * • Trr R-iike protein produced. Thus, 

of he desired protein for use in -did treatments or in 
plications using cell ™ ti ^^T^LST 

£ actors t°r^\Z ™^° * ^ ChU C ° U ' £r0 " 
from bacteria such as Bacillus or 

fungi such as yeast, fro, plants such as tobacco, P"£o^or 
l/abidopsis, and fro. animals, in particular vertebrate cell 
lines such as the Mo-, COS- or CHO cell Ime. 

- «f t-hP nresent invention is to provide a 
Yet another aspect of the present i 

particulars sensitive prooess for the isolation o 

sequences corresponding to low abundance mRHAs m 

of interest The process of the invention composes the 

of *l .»,.. « r » 

h. isolated and used in an amplification reaction using 
Tli nucleotide primers. The seance of ^. oligonucleot, e 
primers contains degenerated DBA seances derived fro. 
Lino acid seance of proteins related to the ge ne of 
interest. This step .ay lead to the amplification of a ready 
k nown members of the gene family of interest and these 
undesired sequences would therefore have to be f^f' 
This object is achieved by using restriction endonucleases 
„h h a e *„own to digest the already-analyzed 
' ge e family. After treatment of the amplified »A popu a on 
lith said restriction endonucleases, the remaining desired 
DNA sequences are isolated by gel electrophoresis and 
Z, lified in a third step by an ^^"f " v "or 

one embodiment of this process. 
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according to the invention include furthermore treatment or 
prevention of connective tissue, skin, mucous membrane, 
endothelial, epithelial, neuronal or renal defects, use in 
the case of dental implants, use as a morphogenic factor used 
for inducing liver tissue growth, induction of the 
proliferation of precursor cells or bone marrow cells, for 
maintaining a differentiated state and the treatment of 
impaired fertility or for contraception. 

Finally, another object of the present invention is : an 
antibody or antibody fragment, which is capable of 
specifically binding to the proteins of the present 
invention. Methods to raise such specific antibody are 
general knowledge. Preferably such an antibody is a 
monoclonal antibody. Such antibodies or antibody fragments 
might be useful for diagnostic methods. 

The following examples illustrate in detail the invention 
disclosed, but should not be construed as limiting the 
invention . 

Example 1 
Isolation of MP-121 

1.1 Total RNA was isolated from human liver tissue (40-year- 
old-male) by the method of Chirgwin et al., Biochemistry 
18 (1979), 5294-5299. Poly A + RNA was separated from 
total RNA by oligo (dT) chromatography according to the 
instructions of the manufacturer (Stratagene Poly (A) 
Quick columns) . 

1.2 For the reverse transcription reaction, poly A + RNA (1- 
2.5 fig) derived from liver tissue was heated for 5 
minutes to 65°C and cooled rapidly on ice. The reverse 
transcription reagents containing 27 U RNA guard 




(Pharmacia), 2 . 5 /ig oligo d(T) 12 . 18 (Pharmacia) 5 x 
buffer (250 mM Tris/HCl pH 8.5; 50 mM MgCl 2 ; 50 mM DTT; 
5 mM each dNTP; 600 mM KCl) and 20 units avian 
myeloblastosis virus reverse transcriptase (AMV, 
Boehringer Mannheim) per /zg poly (A + ) RNA were added. 
The reaction mixture (25 pi) was incubated for 2 hours 
at 42°C. The liver cDNA pool was stored at -20°C. 

1.3 The deoxynucleotide primers OD and OID (Fig. 2) designed 
to prime the amplification reaction were generated on an 
automated DNA-synthesizer (Biosearch) . Purification was 
done by denaturating polyacrylamide gel electrophoresis 
and isolation of the main band from the gel by 
isotachophoresis . The oligonucleotides were designed by 
aligning the nucleic acid sequences of some known 
members of the TGF-& family and selecting regions of the 
highest conservation. An alignment of this region is 
shown in Fig. 2. In order to facilitate cloning, both 
oligonucleotides contained EcoR I restriction sites and 
OD additionally contained an Nco I restriction site at 
its 5 ' terminus . 

1.4 In the polymerase chain reaction, a liver-derived cDNA 
pool was used as a template in a 50 ill reaction mixture. 
The amplification was performed in 1 x PCR-buffer (16.6 
mM (NH 4 ) 2 S0 4 ; 67 mM Tris/HCl pH 8.8; 2 mM MgCl 2 ; 6 . 7 /xM 
EDTA; 10 mM S-mercaptoethanol ; 170 fig/ml BSA (Gibco) ) , 
200 fiM each dNTP (Pharmacia) , 30 pmol each 
oligonucleotide (OD and OID) and 1.5 units Taq 
polymerase (AmpliTaq, Perkin Elmer Cetus) . The PCR 
reaction contained cDNA corresponding to 30 ng of poly 
(A + ) RNA as staring material. The. reaction mixture was 
overlayed by paraffine and 40 cycles (cycle 1: 80s 
93°C/40s 52°C/40s 72°C; cycles 2-9: 60s 93°C/40s 
52°C/40s 72°C; cycles 10-29: 60s 93°C/40s 52°C/60s 
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72-C- cycles 30-31. 60s 93°C/40s 52-C/90S 12-C: cycle 
4 „ . 93-C/40. S2-C/420S 72-C) of the PCR were 

performed. Six PCR-reaction mixtures were pooled 
Turned by subsequent extractions with eoual volumes of 

phenol, phenol/chloroform (1=1 (v/v) ) and 
pnenoj., f ,„„ h ol 124-1 (v/v)) and concentrated 

chloroform/isoamylalcohol (24.1 iv, 

by ethanol precipitation. 

one half of the obtained PCR pool was sufficient for 

tion with the restriction enzymes Sph ! «P armacra, 
and AlwN 1 (Biolabs, . The second half was digested n 
series of reactions by the restriction enzymes Ava 
,BRL) AlwN I (Biolabs) and Tf i I (Biolabs) . The 
"lotion endonuclease digestions were performed rn 
oo ,1 at ,7-C (except Tfi 1 at using 8 unit, of 

■„ a 9 to 12-hour reaction in a buffer 
each enzyme m a 2- to n «<->u 

recommended by the manufacturer. 

Bach DNA sample was fractioned by ^-"^f £ 
a 4% agarose gel (3% FMC Nusieve agarose, Biozym and 
Lrose, BRL) in Tris borate buffer (89 mM Trisbase 89 
agarose, ethidiumbromide 
mM boric acid, 2 mM EDTA, pti oi 

a am Hfiration products (about 200 bp, 
staining uncleaved amplification p 

^rallel) were excised from tne 
c,ize marker was run m parallel 

l and isolated by phenol extraction, an 
ot phenols was added to the excised agarose, w hl ch was 
mJced to small pieces, frozen for 10 ™* 
and centrifuged. The agueous phase 
interphase reextracted by the same 

centrifuged and both aqueous phases were combined DNA 
.as further purified twice by phenol/chloroform and once 
by chloroform/isoamylalcohol extraction. 

X 7 After ethanol precipitation, one fourth or one fifth of 
the isolated DNA was reamplified using the same 
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touN I and sph ■ w identrcal n ^ ^ , 

^ Tnhibin SA se4 found by 

and mm MP -i21, were ,„ 19 , containing 

differed from this ^ paction and 

cleotide exchanges- L gat perfo rmed as 

nucleoli coli HB ioi c loning: A 

transformation - ^ ^ Molecu lar c by 

des cribed » Sambr _ Transformers " 

-^"rn Tesls -e and ^^too* et .1 ■ 
MPi "din g - -anaard P^^^ the doubXe- 

according t done by sequ 

U 989)). Analyse ^ „ dideoxyr ibonucleotide 

, of , niasmids by Q1 sequencing Kit 

stranded pi ing „ with the seq ical 

termination sequ (Unit ed States b 

nase version 2.0 
"Sequenase 

Corporation) - 
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The clone was completed to the 3' end of the c-DNA by a 
method described in detail by Frohman (Amplifications., 
published by Perkin-Elmer Corporation, issue 5 (1990) , 
pp l\-15) . The same liver mRNA which was used for the 
isolation of the first fragment of MP-121 was reverse 
transcribed using a primer consisting of oligo dT (16 
residuesx) linked to an adaptor primer 

(AGAATTCGCATGCCATGGTCGACGAAGC (T) ^ 6 ) . Amplification was 
performed iising the adaptor primer 

(AGAATTCGCATGCCATGGTCGACG) and an internal primer 
(GGCTACGCCATGAACTTCTGCATA) of the MP-121 sequence. The 
amplification products were reamplif ied using a nested 
internal prime r\ ( AC AT AG C AGG C ATGC C TGG T ATTG ) of the MP-121 
sequence and theNadaptor primer. The reamplif ication 
products were clonted after restriction with Sph I in the 
likewise restricted\vector pT7/T3 U19 (Pharmacia) and 
sequenced with the sequencing kit "Sequenase Version 
2.0" (United States Biochemical Corporation). Clones 
were characterized by their sequence overlap to the 3 1 
end of the known MP-121 Sequence . 

One clone, called pl21Lt 3' MP13, was used to isolate a 
Ncol (blunt ended with T4 polymerase ) /SphI fragment. 
This fragment was ligated into a pSK MP-121 (OD/OID) 
vector, where the OD primer sequence was located close 
to the T7 primer sequence of the pSK+ multiple cloning 
site, opened with Sphl/Smal. The resulting plasmid was 
called pMP121DFus6. It contains MP-121 specific sequence 
information starting from position 922 and ending with 
position 1360 of SEQ ID NO. 2. 

Using a Ddel fragment of pMP-121DFus6 as a probe, 
ranging from nucleotide 931 to nucleotide 1304 of SEQ ID 
NO. 2, a human liver cDNA library (Clontech, # HL3006b, 
Lot 36223) was screened by a common method described in 
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detail by Ausubel et al. (Current Protocols in Molecular 
Biology, published by Greene Publishing Associates and 
Wiley- Interscience (1989)). From 8.1 x 10 5 phages, 24 
mixed clones were isolated and re-screened using the 
Ddel fragment. 10 clones were confirmed and the EcoRI 
fragments subcloned into Bluescript SK (Stratagene, # 
212206). EcoRI restriction analysis showed that one 
clone (SK121 L9 . 1 , deposited by the DSM (#9177) has an 
insert of about 2.3 kb. This clone contains the complete 
reading frame of the MP121 gene and further information 
to the 5' and 3 1 end in addition to the sequence 
isolated from mRNA by the described amplification 
methods. The complete sequence of the EcoRI insert of 
SK121 L9 . 1 is shown in SEQ ID NO. 2. The reading frame of 
the MP-121 gene could be confirmed by sequencing of 
another clone (SK121 Lll.l), having the identical 
reading frame sequence as SK121 L9 . 1 . The beginning of 
the start codon of the MP-121 sequence of SK121 L9 . 1 
could be determined at position 128 of SEQ ID NO. 2, 
since there are three stop codons in- frame in front of 
the start codon at positions 62, 77 and 92. The start 
site of the mature MP-121 is at position 836 of SEQ ID 
NO. 2 in sequence analogy to other members of the TGF-J5- 
family, corresponding to amino acid 237 in SEQ ID NO . 4 . 
The stop codon is at position 1184 of SEQ ID NO. 2. 

Plasmid SK121 L9 . 1 was deposited under number 9177 at 
DSM (Deutsche Sammlung von Mikroorganismen und 
Zellkulturen) , Mascheroder Weg lb, Braunschweig, on 
26 . 04 . 94) . 
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Example 2 
Isolation of MP-52 

A further cDNA sequence, MP-52, was isolated according to the 
above described method (Example 1) by using RNA from human 
embryo (8-9 weeks old) tissue. The PCR reaction contained 
cDNA corresponding to 20 ng of poly (A + ) RNA as starting 
material. The reamplif ication step was repeated twice for 
both enzyme combinations. After ligation, 24 clones from each 
enzyme combination were further analyzed by sequence 
analysis. The sample resticted by AlwN I and Sph I yielded a 
new sequence which was named MP-52. The other clones 
comprised mainly BMP6 and one BMP7 sequence. The sample 
restricted by Ava I, AlwN I and Tfi I contained no new 
sequences, but consisted mainly of BMP 7 and a few Inhibin SA 
sequences . 

h^ clone was completed to the 3 1 end according to the above 
described method (Example 1). The same embryo mRNA, which. was 
used f^r the isolation of the first fragment of MP-52, was 
reverse \ranscribed as in Example 1. Amplification was 
performed \sing the adaptor primer (AGAATTCGCATGCCATGGTCGACG) 
and an internal primer (CTTGAGTACGAGGCTTTCCACTG) of the MP-52 
sequence. The amplification products were reamplif ied using a 
nested adaptor primer ( ATTCGCATGCCATGGTCGACGAAG) and a nested 
internal primer (G^SAGCCCACGAATCATGCAGTCA) of the MP-52 
sequence. The reampl\f ication products were cloned after 
restriction with Nco l\in a likewise restricted vector (pUC 
19 (Pharmacia #27-4951-0\) with an altered multiple cloning 
site containing a unique N^p I restriction site) and 
sequenced. Clones were characterized by their sequence 
overlap to the 3' end of the Icnown MP-52 sequence. Some of 
these clones contain the last 1^3 basepairs of the 3 1 end of 
the sequence shown in SEQ ID NO: T\and the 0,56 kb 3' non 
translated region (sequence not shown) . One of these was used 
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a^G a probe to screen a human genomic library (Stratagene 
#94s£203) by a common method described in detail by Ausubel et 
al . XCurrent Protocols in Molecular Biology, published by 
GreeneV publishing Associates and Wiley- Interscience (1989) ) . 
From 8x\p 5 X phages one phage (X 2.7. 4) which was proved to 
contain am insert of about 20 kb, was isolated and deposited 
by the DSm\(#7387) . This clone contains in addition to the 
sequence isolated from mRNA by the described amplification 
methods sequence information further to the 5 1 end. For 
sequence analysas a Hind III fragment of about 7,5 kb was 
subcloned in a likewise restricted vector (Bluescript SK, 
Stratagene #2122o\) . This plasmid, called SKL 52 (H3) MP12, 
was also deposited \by the DSM (# 7353). Sequence information 
derived from this crone is shown in SEQ ID NO: 1. At 
nucleotide No. 1050, Vhe determined cDNA and the respective 
genomic sequence diffeV by one basepair (cDNA: G; genomic 
DNA: A) . We assume the grenomic sequence to be correct , as it 
was confirmed also by sequencing of the amplified genomic DNA 
from embryonic tissue whi^h had been used for the mRNA 
preparation. The genomic DNA contains an intron of about 2 kb 
between basepairs 33 2 and 3 313 of SEQ ID NO: 1. The sequence 
of the intron is not shown. The correct exon/exon junction 
was confirmed by sequencing am amplification product derived 
from cDNA which comprises this\region. This sequencing 
information was obtained by the\help of a slightly modified 
method described in detail by Frohman (Amplifications, 
published by Perkin-Elmer Corporation, issue 5 (1990) , pp 11- 
15) . The same embryo RNA which wasYused for the isolation of 
the 3 1 end of MP-52 was reverse transcribed using an internal 
primer of the MP-52 sequence oriented, in the 5' direction 
(ACAGCAGGTGGGTGGTGTGGACT) . A polyA taM was appended to the 
5 1 end of the first strand cDNA by using terminal 
transferase. A two step amplification wafe performed first by 
application of a primer consisting of oligo dT and an adaptor 
primer; ( AGAATTCGCATGCCATGGTCGACGAAGC (T^ 6 ) ) \and secondly an 
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■jadaptor primer (AGAATTCGCATGCCATGGTCGACG) and an internal 
>rimer (CCACSCAGCCCATCCTTCTCC) of the MP-52 sequence. The 
% amplification^ products were reamplified using the same 
adaptor primerv and a nested internal primer 
(TCCAGGGCACTAATGTCAAACACG) of the MP-52 sequence. 
Consecutively the reamplif ication products were again 
reamplified using\a nested adaptor primer 
(ATTCGCATGCCATGGTC^ACGAAG) and a nested internal primer 
(ACTAATGTCAAACACGTAQCTCTG) of the MP-52 sequence. The final 
reamplif ication products were blunt end cloned in a vector 
(Bluescript SK, StrataVene #212206) restricted with EcoRV. 
Clones were characterized by their sequence overlap to the 
D NA of JL 2,7.4 . 

Plasmid SKL 52 (H3) MP12 was deposited under number 7353 at 
DSM (Deutsche Sammlung von Mikroorganismen und Zellkulturen) , 
Mascheroder Weg lb, 3300 Braunschweig, on 10.12.1992. 



Phage A 2.7.4. was deposited under number 7387 at DSM on 
13 .1.1993 . 



